Survey on Deep Multi-modal Data Analytics: Collaboration, Rivalry, and Fusion

نویسندگان

چکیده

With the development of web technology, multi-modal or multi-view data has surged as a major stream for big data, where each modal/view encodes individual property objects. Often, different modalities are complementary to other. This fact motivated lot research attention on fusing feature spaces comprehensively characterize Most existing state-of-the-arts focused how fuse energy information from deliver superior performance over their counterparts with single modal. Recently, deep neural networks have been exhibited powerful architecture well capture nonlinear distribution high-dimensional multimedia so naturally does data. Substantial empirical studies carried out demonstrate its advantages that benefited methods, which can essentially deepen fusion spaces. In this article, we provide substantial overview in field analytics shallow Throughout survey, further indicate critical components go collaboration, adversarial competition, and Finally, share our viewpoints regarding some future directions field.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Soft multi-modal data fusion

Clustering groups items together that are most similar to each other and sets those that are least similar into different clusters. Methods have been developed to cluster records in a data set that are of only qualitative or quantitative data. Data sets exist that contain a mix of qualitative (nominal and ordinal) and quantitative (discrete and continuous) data. Clustering records of mixed kind...

متن کامل

Multi-modal Data Fusion Techniques and Applications

In recent years, camera networks have been widely employed in several application domains such as surveillance, ambient intelligence or video conferencing. The integration of heterogeneous sensors can provide complementary and redundant information that fused to visual cues allows the system to obtain an enriched and more robust scene interpretation. A discussion about possible architectures an...

متن کامل

Multi-modal Data Fusion: A Description

Clustering groups records that are similar to each other into the same group, and those that are less similar into different groups. Clustering data of mixed types is difficult due to different data characteristics. Extending Gower’s metric for nominal and ordinal data is incorporated into an agglomerative hierarchical clustering algorithm to cluster mixed type data. This paper describes the ex...

متن کامل

M3Fusion: A Deep Learning Architecture for Multi-{Scale/Modal/Temporal} satellite data fusion

Modern Earth Observation systems provide sensing data at different temporal and spatial resolutions. Among optical sensors, today the Sentinel-2 program supplies high-resolution temporal (every 5 days) and high spatial resolution (10m) images that can be useful to monitor land cover dynamics. On the other hand, Very High Spatial Resolution images (VHSR) are still an essential tool to figure out...

متن کامل

Miscommunication in Multi-modal Collaboration

We explore grounding and the sub-phenomena of miscommunication and repair from both theoretical and empirical perspectives. From a theoretical perspective, we classify several types of miscommunication, as action or perception failure, and part of a more general case of non-alignment of the mental states of agents. From an empirical perspective, we present a preliminary analysis of examples of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Multimedia Computing, Communications, and Applications

سال: 2021

ISSN: ['1551-6857', '1551-6865']

DOI: https://doi.org/10.1145/3408317